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Abstract 



For k > 3 let A C [1, N] be a set not containing a solution to a\X\ + ■ ■ ■ + 
cikXk = ciiXk+i + • ■ ■ + dk x 2k in distinct integers. We prove that there is an 
e > depending on the coefficients of the equation such that every such A has 
0{N 1 / 2 ~ e ) elements. This answers a question of I. Ruzsa. 

Introduction 

The problems of estimating the size of a largest subset of [1,N] not containing a 
solution to a given linear equation arise very frequently in combinatorial number 
theory. For example, the sets with no non-trivial solutions to x\ + x 2 — 2x3 = 
and x\ + x 2 = £3 + x^ are sets with no arithmetic progressions of length three, and 
Sidon sets respectively. For several of the more prominent equations, like the two 
equations above, there are large bodies of results that deal with the structure and the 
size of solution- free sets. The first systematic study of general linear equations was 
undertaken by Ruzsa [Ruz93 , Ruz95] . To ascribe a precise meaning to the concept of 
a "set with no non-trivial solution" he introduced two definitions of a trivial solution. 
One of them is that a solution is non-trivial if all the variables are assigned different 
values. 

For a fixed linear equation denote by R{N) the size of a largest set of integers 
in [1,-ZV] with no solution to the equation in distinct integers. Ruzsa [Ruz93] showed 
that if k > 2, then for the symmetric equation 

aixi + . . . + atxk = aix k+ i -\ h aux 2 k (1) 

one has R{N) = 0{N 1 ^ 2 ). For k = 2, the estimate is tight for the Sidon equation. 
Ruzsa gave the example of equation 

xi + d{x 2 + . . . + x k ) = x k+ i + d(x k+2 + • • • + x 2k ) (2) 

and the set A of integers in [l,iV] whose development in the base d 2 k consists only 
of digits 0, . . . , d — 1. Since addition of elements of A in ([2]) involves no carries in 
base d 2 k, for every solution to ([2]) with elements of A one necessarily has that x\ and 
x k+ i are equal digit by digit. Thus x\ = x k+ i implying that A contains no solution 
([2]) in distinct integers. Since \A\ = f2(iV 1//2-c / losrf ), this example shows that there 
is no e > such that the estimate R(N) = 0(N 1 / 2 ^ 6 ) holds for all equations of the 



1 



form dH). However, Ruzsa asked whether for k > 3 there is an e > depending on 
the coefficients of © such that R(N) = OiN 1 / 2 ^). We answer in affirmative. 

Theorem 1. Let \\a\\i = 5^i=il a i|- If k>3 and a% ^ for 1 < i < k, then 



i 



R{N) = 0(N 2 c WiMii). 

To explain the idea behind the proof of theorem [1] we examine the proof that 
R(N) = 0{N 1 / 2 ). Let au+i = —a%. Then the equation (pQ) can be written as ^ aixi = 
0. Assume that Ac [1,N] has M elements and contains only trivial solutions to ([1]). 
Let r(m) denote the number of solutions to 

a\X\ + • • • + a^Xk = m, x% € A. 

Then E = ^ r(m) 2 is the total number of solutions to (pQ) in A. Since there are at 
most ||a||i-ZV values of m for which r(m) > by Cauchy-Schwarz it follows that 

^ = ^r(m) 2 >p^. (3) 

The next step is to bound the number of solutions with X{ = Xj from above. Let 
1 < t < 2k be any index other than i or j. Since 

X r = —( y] a l x l + ifli + a j) x i) > ( 4 ) 

\ /. • ' 

it follows that equation ([1]) uniquely determines x r in terms of other x's. Thus at 
most one in every M assignments of variables results in a solution to (pQ). Since there 
( 2 2 fc ) choices of i and j 

2k\ 1 , ,2fc-i 



£< — iVf . (5) 

~ \ 2 J M K ' 

The bound |^| = (^(iV 1 / 2 ) follows by comparison with (|3|). 

In order for this argument to be tight the linear form on the right side of ([4]) 
should often take values in A. Heuristically it corresponds to the sumset YLi^i j r x i ' 
A + (xj + Xj) ■ A being not much larger than A itself, where t • A = {ta : a € ^4} is 
the t-dilate of A. Plunnecke-Ruzsa inequalities tells us that if some sumset involving 
A is not much larger than A, then every sumset involving only A is not much larger 
than A. In particular we expect Yli<k Xl ' ^ ^° ^ e °^ s * ze om y as °PP ose d to ~ N 
which was assumed in the derivation of the lower bound ([3]). Therefore, we expect 
that if ([5]) is close to being tight, then ([3]) is not, and vice versa. The remainder of 
the paper is a rigorous justification of the heuristic argument above. 



Proof 

This section is organized as follows. First we state the tools from additive combina- 
torics that we need. Then we introduce the notation that is going to be used in the 
proof of theorem [TJ Finally, after a couple of preliminary lemmas the theorem [1] is 
proved. 
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Lemma 2 (Ruzsa's triangle inequality [TV06| . lemma 2.6). For any finite A,B,C C 
Z we have 

Lemma 3 (Pliinnecke's inequality |TV06j , corollary 6.26). For any finite sets A,Bc 
Zif\A + B\< K\A\, then \kB\ < K k \A\. 

We will also use the hypergraph version of Balog-Szemeredi-Gowers theorem due 
to Sudakov, Szemeredi and Vu [SSV05] . Let Ax,. . . ,A k be sets of integers. If H is a 
subset of A\ x ■ ■ ■ A k , then Yh 4 is the collection of all sums oq + • • • + a k , where 
(ai,...,a k ) G H. 

Lemma 4 ([SSV05], theorem 4.3). For any integer k > 1 there are positive-valued 
polynomials f k (x, y) and gk{x, y) such that the following holds. Let n, C, K be positive 
numbers. If A\,...,A k are sets of n positive integers, H C A\ X • • • X A k with 
\H\ > n k /K and \^2h A%\ < Cn, then one can find subsets A\ C A{ such that 

fi 

I4| > , for alll<i<k, 

Jk\y,K) 

\A' 1 + --- + A' k \<g k (C,K)n. 

From the heuristic argument above it is clear that we will need to count number of 
solutions of various equations. It is therefore advantageous to introduce appropriate 
notation. Let r(Ai, . . . , A^; m) denote the number of solutions to 

a\ H + afc = m, cii G A{. 

Then E(A x , ...,A k ;B 1 ,...,B l ) = ^ im r(A 1 , . . . , A k ;m)r{B u . . . , B^m) counts the 
number of solutions to 

a% H h a k = bi H ai G A i: bj G By 

We write E(A 1 , . . .,A k ) to denote E(A 1 , . . . , A k ; A t , . . . , A k ). 

Lemma 5. There is a positive-valued polynomial h k {x) such that if A\, . . . , A k are 
k sets of integers with n elements each, and E(A\, . . . ,A k ) > cn 2k ~ 1 , then there are 
subsets A\ C Ai such that 

141 — h k (c)n, for all 1 < i < k, 



I4 + --- + 4I< 



n 



h k (c) 



Proof. For brevity of notation write E = E(Ai, . . . , A k ) and r(m) = r(A\, . . . , A k ; m). 

Let S = {m : r(m) > cn k ~ l /2}, and H = {(fl\, . . . ,a k ) G A\ x • • • x A k : aiH Ya k G 

S}. Note that Y^n^i = S and 



n k 2 



151 < r ; = -n. 

1 1 - cn k - 1 /2 c 
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Since r(m) < n k 1 for every m, and Y^m^s r ( m ) = n k — \H\, it follows that 
E=Y^ r(m) 2 + Y, r(m) 2 < ^[n* -1 + {n k - \H\)cn k ~ 1 /2. 

Therefore > cn k /2, and we deduce from lemma H] the existence of the subsets ^ 
with the desired properties. □ 

Lemma 6. For every k > 2 there is a positive-valued polynomial Pk(x) such if 
t\,...,tk are any k positive integers, and A C Z is an n-element set satisfying 
E(t\ ■ A, . . . ,tk ■ A) > cn 2k ~ l , then for every I -tuple of positive integers s\, . . . ,S[ 
we have E(s\ ■ A, . . . ,si ■ A) > pj i {c)» s » 1 n 2l ~ 1 . 

Proof. Apply the lemma above with A; = ti - A to obtain A\. Ruzsa's triangle in- 
equality applied to —A[ and A' 2 + • • • + A' k yields that 

' 1 11 \A' 2 + --- + A' k \ -hkW 

Set A = (1/ti) • A'-y. Then A is a subset of A satisfying \A\ > hk{c)n and \A — A\ < 
n/hk(c) 3 . Pliinnecke's inequality then implies that |||s||i-A| < /i/^c)" 3 " 8 " 1 ?!. The 
inclusion si • A + ■ ■ ■ + si ■ A C together with Cauchy-Schwarz inequality yield 

E(s x ■ A,..., si ■ A) > E(si ■ A, . . . , si ■ A) 

\A\ 21 



> 



> 



\si-A + --- + s r A\ 
h k {c) 2l n 21 
\\\s\\\A\ 
> h k (cf^ +2l n 2l -\ 

Since si, . . . , si are positive integers, ||s||i > I, and the lemma follows. □ 

Proof of theorem [0 Without loss of generality we may assume that a\ , . . . , are 
positive. Rewrite equation dU) as 

aixi + . . . + a 2 kX2k = 0, (6) 

where a k +i = — a«. Let A C [1,N] be an M-element set containing only trivial 
solutions to ©. The number of solutions to equation © with variables in A is 
E = E(a\ ■ A, . . . , a,k - A). Let ly be the number of solutions with x% = Xj. By the 
pigeonhole principle for at least one pair i / j we have > i?/( 2 2 fc ). Fix that pair. 
Next we partition the index set {1, ... ,2k} \ {i,j} arbitrarily into sets I\ = 
• • • , ^i,fc-i} and 1% = ■ ■ ■ , h,k-i} with k — 1 elements each. For convenience 

write 

n(m) =r(a h ■ A t ...,a h ■ A h ;m) 



4 



and Ei = ^ri(m) 2 . Define r 2 and E 2 analogously with respect to I 2 - Then by 
C auchy- S chwar z 

Tij = £ 5>iMr a (-(ai + a i) x - m ) ^ E e i 2e I /2 = me\ ,2 eI' 2 . 

Either E 1 or E 2 is at least E/( 2 2 k )M. We can assume it is true of E\. Then by 
lemma [6] above we have 



( El \ Ml 

1 ^ M 2fc-3 J 



f)MM 2k ~ 



If degpk-i = d, then we obtain that 

E = o(M 2fc ~ 2 ~ 1/(d||a||l_1) ). 
Since E > M 2fc /|[a||iA r , the theorem follows. □ 



Conclusion 

Theorem [T] gives the estimate R(N) = 0(N?~r^ with r = c(fe)||o||i. Ruzsa's example 
shows that no estimate better than r = c(k) log 1 1 a 1 1 1 can be true. 

After this paper was written, a Plunnecke-type estimate on sums of dilates of the 
form s± ■ A + ■ ■ ■ + si ■ A, which appears in the proof of lemma [U was established in 
[Buk07]. Whereas Plunnecke's inequality and the inclusion si-A + --- + si-Ac \\s\\iA 
yield the exponent of ||s||i, the new inequality yields the exponent C log 1 1 s 1 1 1 for an 
absolute constant C. Plugging that into the proof of lemma [U one obtains that 
theorem [1] is valid with r = c(k) log||a||i, which is sharp in view of Ruzsa's example. 

Acknowledgement. I thank Benjamin Sudakov for stimulating discussions. 
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